SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: KABUSHIKI KAISHA HAYASHIBARA SEIBUTSU KAGAKU 
KENKYUJO 



(ii) TITLE OF INVENTION: POLYPEPTIDE HAVING /3- FRUCTOFURANOSIDASE 
ACTIVITY 



(iii) NUMBER OF SEQUENCES: 6 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: BROWDY AND NEIMARK 

(B) STREET: 419 Seventh Street, N.W. , Suite 300 

(C) CITY: Washington 

(D) STATE: D.C. 

(E) COUNTRY: USA 

(F) ZIP: 20004 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/870,827 

(B) FILING DATE: 06-JUN-1997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: JP 170,630/1996 

(B) FILING DATE: 10-JUN-1996 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: BROWDY, Roger L. 

(B) REGISTRATION NUMBER: 25,618 

(C) REFERENCE/DOCKET NUMBER: TSUSAKI=2 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202-628-5197 

(B) TELEFAX: 202-737-3528 



(2) INFORMATION FOR SEQ ID NO : 1 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(iii) Fragment type: N- terminal fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Met Asn Ser Gly Asp Tyr Lys Glu Asp Tyr Gly Phe Ala His lie Thr 
15 10 15 

Arg Ala Asp Met Leu 
20 



(2) INFORMATION FOR SEQ ID NO : 2 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(iii) Fragment type: internal fragment 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Ser Val Phe Asp Gly Gly Asp Gly Thr Val Tyr Gin 
15 10 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 455 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 





(xi) SEQUENCE DESCRIPTION: 


SEQ 


ID N0:3: 












Met 


Asn Ser Gly Asp 


Tvr 


Lys 


Glu Asp 


Tyr 


Gly Phe 


Ala 


His 


- 




1 








5 










10 












Phe 


Arg 


Ala Asp 


Met 


Leu 


Lvs 


He 


Pro Gly 


Gin 


Gin 


Asn 


Ser 


Pro 


Gin 




20 








25 








30 




Ala 


Lys 


Val 


Pro 


Gin 


Phe 


Asn 


Ala 


Ser 


Ala 


He 


Lys 


Asn 


He 


Asp 


Ser 




35 










40 










45 








Lys 


Gly Tyr 


Asp 


Lys 


Ser 


Gly 


Asn 


Leu 


He 


Asp 


Leu 


Asp 


Val 


Trp 


Asp 


50 




















60 








Gly 


Ser 


Trp 


Pro 


Leu 


Gin 


Asn 


Ala 


Asp 


Gly 


Thr 


Ala 


Ala 


Asn 


Tyr 


His 


65 








70 










75 










80 


Tyr 


His 


He 


Val 


Ser 


Ala 


Leu 


Ala Gly 


Asp 


Pro 


Lys 


Asn 


Ser 


Asp 


Asp 








85 










90 










95 




Thr 


Pro 


Leu 


His 


Leu 


Phe 


Tvr 


Gin 


Lys 


Val 


Gly Asp 


Thr 


Ser 


He 


Asp 








100 










105 










110 






Ser 


Trp 


Lys 


Asn 


Ala 


Gly 


Ara 


Val 


Phe 


Glu 


Asp 


Met 


Asp 


Lys 


Phe 


Val 




115 










120 










125 








Pro 


Asn Asp 


Pro 


Tyr 


Leu 


Lys 


Tyr 


Gin 


Thr 


Gin 


Glu 


Trp 


Ser 


Gly 


Ser 




130 










135 










140 










Ala 


Thr 


Leu 


Thr 


Lys 


Asp 


Gly 


Gin 


Val 


Arg 


Leu 


Phe 


Tyr 


Thr 


Asp 


Tyr 


145 








150 










155 








He 


160 


Ser 


Gly Asn 


Pro 


Glu 


Asp 


Gly 


Gly Thr 


Gly 


Ala Gly 


Asn 


Gin 


He 










165 










170 










175 




Ser 


Thr 


Ala 


Gin 




Asn 


Leu 


Ser 


Gin 


Pro 


Asp 


Ala 


Ala 


Thr 


Leu 


Lys 








18 0 










185 










190 






Val 


Asp 


Gly 


Val 


Ser 


Asp 


His 


Lys 


Ser 


Val 


Phe 


Asp 


Gly Gly 


Asp Gly 




195 










200 










205 








Thr 


Val 


Tyr 


Gin 


Asn 


He 


Gin 


Gin 


Phe 


He 


Asp 


Glu 


Gly Lys 


Trp 


He 




210 








215 










220 










Ser 


Gly Asp 


Asn 


His 


Thr 


Leu 


Arg 


Asp 


Pro 


His 


Tyr 


Val 


Glu 


Asp 


Lys 


225 










230 










235 










240 


Gly 


His 


Lys 


Tyr 


Leu 


Val 


Phe 


Glu 


Ala 


Asn 


Thr 


Gly 


Thr 


Thr 


Asp 


Gly 




245 










250 










255 




Tyr 


Gin 


Gly 


Asp 


Gin 


Ser 


Phe 


Asn 


Asn 


Lys 


Ala 


Tyr 


Tyr Gly 


Gly 


Ser 




260 










265 










270 






Asp 


Val 


Phe 


Phe 


Gin 


Asn 


Glu 


Lys 


Asn 


Lys 


Leu 


Leu 


Gin 


Ser 


Pro 


Lys 




275 










280 










285 








Lys 


Gin 


He 


Ala 


Ser 


Leu 


Ala 


Asn 


Gly 


Ala 


Leu Gly 


He 


Val 


Glu 


Leu 


290 










295 










300 










Ala 


Asp 


Asp 


Tyr 


Thr 


Val 


Lys 


Ser 


Val 


Met 


Lys 


Pro 


Leu 


Val 


Ala 


Ser 


305 




310 










315 










320 


Asn 


Thr 


Val 


Ala 


Asp 


Glu 


Val 


Glu 


Arg 


Ala 


Asn 


He 


Phe 


Lys 


Met 


Asn 










325 










330 










335 




Asn 


Lys 


Trp 


Tyr 


Leu 


Phe 


Thr 


Asp 


Ser 


Arg 


Gly 


Ser 


Lys 


Met 


Thr 


Ser 




340 










345 










350 






Asp 


Gly 


He 


Asn 


Asp 


Lys 


Asp 


Val 


Tyr 


Met 


Leu 


Gly 


Pro 


Gly 


Gly Asp 


355 










360 










365 








Ser 


Leu 


Asn 


Gly 


Pro 


His 


Asn 


Pro 


He 


Asn 


Glu 


Thr 


Gly 


Leu 


Val 


Leu 




370 








375 










380 











- 36 - 



Asn 


Met Asn 


Leu 


Asp 


Pro 


Ala 


Asp 


Leu 


Thr 


His 


Thr 


Tyr Ser His 


Cys 


385 






390 










395 






400 


Gly 


lie Pro 


His 


Pro 


Glu 


Gly Asn 


Asn 


Val 


Val 


Leu 


Thr Ser Tyr 


Met 






405 










410 






415 




Thr 


Asn Arg 


Gly 


Plie 


Tyr 


Pro 


Glu 


His 


His 


Ser 


His 


Leu Arg Asp 


Lys 




420 










425 








430 




Leu 


Gly Val 


Asn 


He 


Lys 


Gly 


Ser 


Asp 


Thr Ser Gly Gly Glu Asn 


Ser 




435 










440 










445 




Ser 


Gly Gin Gly 


Gin 


Phe 


Pro 


















450 








455 

















(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1365 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

ATG AAC AGC GGG GAC TAC AAG GAA GAC TAT GGT TTT GCC CAT ATT ACA 
Met Asn Ser Gly Asp Tyr Lys Glu Asp Tyr Gly Phe Ala His He Thr 
1.5 10 15 

CGC GCT GAC ATG CTA AAA ATT CCA GGA CAA CAA AAC AGT CCT CAA TTT 
Arg Ala Asp Met Leu Lys He Pro Gly Gin Gin Ash Ser Pro Gin Phe 
20 25 30 

AAA GTG CCT CAA TTC AAT GCA TCA GCA ATC AAA AAC ATT GAT TCG GCA 
Lys Val Pro Gin Phe Asn Ala Ser Ala He Lys Asn He Asp Ser Ala 
35 40 45 

AAA GGG TAT GAT AAG TCA GGC AAC TTA ATA GAT TTA GAT GTA TGG GAT 
Lys Gly Tyr Asp Lys Ser Gly Asn Leu He Asp Leu Asp Val Trp Asp 
50 55 60 

AGC TGG CCA CTG CAA AAC GCT GAT GGT ACT GCG GCA AAT TAT CAT GGA 
Ser Trp Pro Leu Gin Asn Ala Asp Gly Thr Ala Ala Asn Tyr His Gly 
65 70 75 80 

TAT CAC ATC GTC TCC GCT TTA GCA GGT GAC CCA AAA AAC AGT GAT GAT 
Tyr His He Val Ser Ala Leu Ala Gly Asp Pro Lys Asn Ser Asp Asp 
85 90 95 

ACT CCA CTT CAT TTA TTC TAT CAA AAA GTC GGT GAT ACA TCG ATT GAC 
Thr Pro Leu His Leu Phe Tyr Gin Lys Val Gly Asp Thr Ser He Asp 
100 105 110 115 

AGC TGG AAA AAT GCT GGA AGA GTA TTT GAA GAT ATG GAT AAA TTT GTT 
Ser Trp Lys Asn Ala Gly Arg Val Phe Glu Asp Met Asp Lys Phe Val 
120 125 130 

CCA AAT GAT CCG TAT CTT AAA TAT CAA ACA CAG GAG TGG TCA GGT TCT 
Pro Asn Asp Pro Tyr Leu Lys Tyr Gin Thr Gin Glu Trp Ser Gly Ser 
135 140 145 

GCT ACT TTA ACC AAA GAT GGC CAA GTC CGT TTA TTC TAT ACA GAT TAC 
Ala Thr Leu Thr Lys Asp Gly Gin Val Arg Leu Phe Tyr Thr Asp Tyr 
150 155 160 



TCA GGT AAT CCT GAA GAT GGT GGA ACC GGT GCT GGT AAC CAA ATC ATT 
Ser Gly Asn Pro Glu Asp Gly Gly Thr Gly Ala Gly Asn Gin He He 



528 



TCA ACT GCT CAA GTA AAC TTA TCC 
Ser Thr Ala Gin Val Asn Leu Ser 
180 185 

GTC GAT GGA GTA TCT GAT CAT AAA 
Val Asp Gly Val Ser Asp His Lys 
200 



CAG CCG GAT GCA GCT ACA CTT AAA 576 
Gin Pro Asp Ala Ala Thr Leu Lys 
190 195 

TCT GTC TTT GAT GGC GGA GAC GGT 624 
Ser Val Phe Asp Gly Gly Asp Gly 
205 210 



ACA GTT TAT CAA AAT ATT CAG CAA TTT ATC GAT GAA GGC AAG TGG ATT 672 
Thr Val Tyr Gin Asn lie Gin Gin Phe lie Asp Glu Gly Lys Trp He 
215 220 225 

TCA GGT GAT AAC CAT ACT TTA AGA GAC CCT CAC TAT GTT GAA GAT AAG 72 0 
Ser Gly Asp Asn His Thr Leu Arg Asp Pro His Tyr Val Glu Asp Lys 
230 235 240 

GGC CAT AAA TAT CTT GTC TTT GAA GCG AAT ACT GGA ACA ACA GAT GGT 768 
Gly His Lys Tyr Leu Val Phe Glu Ala Asn Thr Gly Thr Thr Asp Gly 
245 250 255 

TAT CAA GGC GAT CAG TCT TTC AAT AAT AAA GCT TAC TAT GGC GGA AGT 816 
Tyr Gin Gly Asp Gin Ser Phe Asn Asn Lys Ala Tyr Tyr Gly Gly Ser 
260 265 270 275 

GAC GTC TTC TTC CAG AAT GAA AAA AAT AAA CTG CTT CAA AGT CCT AAA 864 
Asp Val Phe Phe Gin Asn Glu Lys Asn Lys Leu Leu Gin Ser Pro Lys 
280 285 290 

AAA CAA ATT GCT TCT TTA GCG AAT GGT GCA TTA GGC ATT GTT GAA TTG 912 
Lys Gin He Ala Ser Leu Ala Asn Gly Ala Leu Gly He Val Glu Leu 
295 300 305 

GCC GAT GAC TAT ACA GTG AAA AGT GTT ATG AAA CCA TTA GTC GCA TCA 96 0 
Ala Asp Asp Tyr Thr Val Lys Ser Val Met Lys Pro Leu Val Ala Ser 
310 315 320 

AAC ACA GTA GCA GAT GAA GTC GAA CGC GCC AAT ATA TTT AAA ATG AAT 1008 
Asn Thr Val Ala Asp Glu Val Glu Arg Ala Asn He Phe Lys Met Asn 
325 330 335 

AAT AAA TGG TAT CTA TTC ACG GAT TCA AGA GGA TCC AAA ATG ACG AGT 1056 
Asn Lys Trp Tyr Leu Phe Thr Asp Ser Arg Gly Ser Lys Met Thr Ser 
340 345 350 355 

GAT GGA ATT AAC GAC AAA GAT GTT TAT ATG CTA GGG CCC GGA GGC GAC 1104 
Asp Gly He Asn Asp Lys Asp Val Tyr Met Leu Gly Pro Gly Gly Asp 
360 365 370 

TCC TTA AAT GGC CCA CAC AAC CCG ATA AAT GAA ACT GGA CTT GTA TTG 1152 
Ser Leu Asn Gly Pro His Asn Pro He Asn Glu Thr Gly Leu Val Leu 
375 380 385 

AAC ATG AAT CTT GAC CCT GCT GAT CTC ACA CAC ACT TAC TCT CAT TGC 12 00 
Asn Met Asn Leu Asp Pro Ala Asp Leu Thr His Thr Tyr Ser His Cys 
390 395 400 

GGT ATC CCG CAC CCT GAA GGT AAT AAT GTG GTA CTC ACA AGT TAT ATG 124 8 
Gly He Pro His Pro Glu Gly Asn Asn Val Val Leu Thr Ser Tyr Met 
405 410 415 

ACG AAT AGA GGC TTC TAT CCA GAA CAT CAC TCT CAC CTG CGG GAC AAG 12 96 
Thr Asn Arg Gly Phe Tyr Pro Glu His His Ser His Leu Arg Asp Lys 
420 425 430 435 



CTT GGG GTT AAT ATT AAA GGG TCT GAC ACA TCT GGA GGA GAA AAT AGT 134 4 
Leu Gly Val Asn He Lys Gly Ser Asp Thr Ser Gly Gly Girt Asn Ser 
440 445 450 



TCC GGA CAA GGA CAA TTC CCA 
Ser Gly Gin Gly Gin Phe Pro 
455 



1365 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2408 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus sp . 

(B) INDIVIDUAL ISOLATE: V23 0 (PERM BP- 5 054) 



( ix } FEATURE : 

(A) NAME /KEY : 5'UTR 

(B) LOCATION: 1..360 

(C) IDENTIFICATION METHOD: E 



(A) NAME /KEY : signal peptide 

<B) LOCATION: 361.. 456 

(C) IDENTIFICATION METHOD: S 

(A) NAME /KEY : mat peptide 

(B) LOCATION: 457.. 1821 

(C) IDENTIFICATION METHOD: S 

(A) NAME /KEY : 3'UTR 

(B) LOCATION: 1822.. 2408 

(C) IDENTIFICATION METHOD: E 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 



CGGGGAAAAT ACT AG AT TCC AATTGGCCAG ACTTCCCAGT TGGTGTAAGA GAAGAGTTCG 60 

GACTGCCAAT GCAGCTGTGC GTAAGAAAAC AGCTTACTCA TGAGCAATTA CTAGAAGAAT 120 

TTCAAAAGTC CTGGGATAAG GCCAAGTCCA CTTTGAAATA AACTTTTCAG CCTCTGTGTG 18 0 

GGGGCTTTTT TGTTTTTATT TATTTCAACT GCAAGTGGTC CATCCCCTAT ATCAATTTAA 24 0 

GACGAAATTC TAATCAATCC ATGCCATCCC CAATAAACTC GTCCTCCTCT ATACTTTTAA 30 0 

TTAATAAGAA ACTATCAAGA GCTTTCTTAT CAAATTCATA CATATCCAAG GAGGGAGACG 360 

ATG AAC TTC AAA AGA TTG GCG AAA AAA GCA GCT GCC GTA ACC TTC AGG 408 
Met Asn Phe Lys Arg Leu Ala Lys Lys Ala Ala Ala Val Thr Phe Arg 
-30 -25 -20 



ACT GCT ATA TTA GTA GGA GCG GAC GGA CCG CAT ATT TTT GCG CAG CAA 
Thr Ala lie Leu Val Gly Ala Asp Gly Pro His lie Phe Ala Gin Gin 
-15 -10 -5 



ATG AAC AGC GGG GAC TAC AAG GAA GAC TAT GGT TTT GCC CAT ATT ACA 504 
Met Asn Ser Gly Asp Tyr Lys Glu Asp Tyr Gly Phe Ala His lie Thr 
15 10 15 



CGC GCT GAC ATG CTA AAA ATT CCA GGA CAA CAA AAC AGT CCT CAA TTT 
Arg Ala Asp Met Leu Lys lie Pro Gly Gin Gin Asn Ser Pro Gin Phe 



AAA GTG CCT CAA TTC AAT GCA TCA GCA ATC AAA AAC ATT GAT TCG GCA 
Lys Val Pro Gin Phe Asn Ala Ser Ala lie Lys Asn lie Asp Ser Ala 



AAA GGG TAT GAT AAG TCA GGC AAC TTA ATA GAT TTA GAT GTA TGG GAT 
Lys Gly Tyr Asp Lys Ser Gly Asn Leu lie Asp Leu Asp Val Trp Asp 
50 55 60 



648 



AGC TGG CCA CTG CAA AAC GCT GAT GGT ACT GCG GCA AAT TAT CAT GGA 
Ser Trp Pro Leu Gin Asn Ala Asp Gly Thr Ala Ala Asn Tyr His Gly 
65 70 75 80 

TAT CAC ATC GTC TCC GCT TTA GCA GGT GAC CCA AAA AAC AGT GAT GAT 
Tyr His lie Val Ser Ala Leu Ala Gly Asp Pro Lys Asn Ser Asp Asp 
85 90 95 

ACT CCA CTT CAT TTA TTC TAT CAA AAA GTC GGT GAT ACA TCG ATT GAC 
Thr Pro Leu His Leu Phe Tyr Gin Lys Val Gly Asp Thr Ser lie Asp 
100 105 110 

AGC TGG AAA AAT GCT GGA AGA GTA TTT GAA GAT ATG GAT AAA TTT GTT 
Ser Trp Lys Asn Ala Gly Arg Val Phe Glu Asp Met Asp Lys Phe Val 
115 120 125 

CCA AAT GAT CCG TAT CTT AAA TAT CAA ACA CAG GAG TGG TCA GGT TCT 
Pro Asn Asp Pro Tyr Leu Lys Tyr Gin Thr Gin Glu Trp Ser Gly Ser 
130 135 140 

GCT ACT TTA ACC AAA GAT GGC CAA GTC CGT TTA TTC TAT ACA GAT TAC 
Ala Thr Leu Thr Lys Asp Gly Gin Val Arg Leu Phe Tyr Thr Asp Tyr 
145 150 155 160 

TCA GGT AAT CCT GAA GAT GGT GGA ACC GGT GCT GGT AAC CAA ATC ATT 
Ser Gly Asn Pro Glu Asp Gly Gly Thr Gly Ala Gly Asn Gin lie lie 
165 170 175 

TCA ACT GCT CAA GTA AAC TTA TCC CAG CCG GAT GCA GCT ACA CTT AAA 
Ser Thr Ala Gin Val Asn Leu Ser Gin Pro Asp Ala Ala Thr Leu Lys 
180 185 190 

GTC GAT GGA GTA TCT GAT CAT AAA TCT GTC TTT GAT GGC GGA GAC GGT 
Val Asp Gly Val Ser Asp His Lys Ser Val Phe Asp Gly Gly Asp Gly 
195 200 205 

ACA GTT TAT CAA AAT ATT CAG CAA TTT ATC GAT GAA GGC AAG TGG ATT 
Thr Val Tyr Gin Asn lie Gin Gin Phe lie Asp Glu Gly Lys Trp lie 
210 215 220 

TCA GGT GAT AAC CAT ACT TTA AGA GAC CCT CAC TAT GTT GAA GAT AAG 
Ser Gly Asp Asn His Thr Leu Arg Asp Pro His Tyr Val Glu Asp Lys 
225 230 235 240 

GGC CAT AAA TAT CTT GTC TTT GAA GCG AAT ACT GGA ACA ACA GAT GGT 
Gly His Lys Tyr Leu Val Phe Glu Ala Asn Thr Gly Thr Thr Asp Gly 
245 250 255 

'TAT CAA GGC GAT CAG TCT TTC AAT AAT AAA GCT TAC TAT GGC GGA AGT 
Tyr Gin Gly Asp Gin Ser Phe Asn Asn Lys Ala Tyr Tyr Gly Gly Ser 
260 265 270 

GAC GTC TTC TTC CAG AAT GAA AAA AAT AAA CTG CTT CAA AGT CCT AAA 
Asp Val Phe Phe Gin Asn Glu Lys Asn Lys Leu Leu Gin Ser Pro Lys 
275 280 285 

AAA CAA ATT GCT TCT TTA GCG AAT GGT GCA TTA GGC ATT GTT GAA TTG 
Lys Gin lie Ala Ser Leu Ala Asn Gly Ala Leu Gly lie Val Glu Leu 
290 295 300 

GCC GAT GAC TAT ACA GTG AAA AGT GTT ATG AAA CCA TTA GTC GCA TCA 
Ala Asp Asp Tyr Thr Val Lys Ser Val Met Lys Pro Leu Val Ala Ser 
305 310 315 320 

AAC ACA GTA GCA GAT GAA GTC GAA CGC GCC AAT ATA TTT AAA ATG AAT 
Asn Thr Val Ala Asp Glu Val Glu Arg Ala Asn lie Phe Lys Met Asn 
325 330 335 
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AAT AAA TGG TAT CTA TTC ACG GAT TCA AGA GGA TCC AAA ATG ACG AGT 
Asn Lys Trp Tyr Leu Phe Thr Asp Ser Arg Gly Ser Lys Met Thr Ser 
340 345 350 



1512 



GAT GGA ATT AAC GAC AAA GAT GTT TAT ATG CTA GGG CCC GGA GGC GAC 1560 
Asp Gly lie Asn Asp Lys Asp Val Tyr Met Leu Gly Pro Gly Gly Asp 
355 360 365 

TCC TTA AAT GGC CCA CAC AAC CCG ATA AAT GAA ACT GGA CTT GTA TTG 1608 
Ser Leu Asn Gly Pro His Asn Pro lie Asn Glu Thr Gly Leu Val Leu 
370 375 380 

AAC ATG AAT CTT GAC CCT GCT GAT CTC ACA CAC ACT TAC TCT CAT TGC 1656 
Asn Met Asn Leu Asp Pro Ala Asp Leu Thr His Thr Tyr Ser His Cys 
385 390 395 400 

GGT ATC CCG CAC CCT GAA GGT AAT AAT GTG GTA CTC ACA AGT TAT ATG 1704 
Gly lie Pro His Pro Glu Gly Asn Asn Val Val Leu Thr Ser Tyr Met 
405 410 415 

ACG AAT AGA GGC TTC TAT CCA GAA CAT CAC TCT CAC CTG CGG GAC AAG 1752 
Thr Asn Arg Gly Phe Tyr Pro Glu His His Ser His Leu Arg Asp Lys 
420 425 430 

CTT GGG GTT AAT ATT AAA GGG TCT GAC ACA TCT GGA GGA GAA AAT AGT 1800 
Leu Gly Val Asn He Lys Gly Ser Asp Thr Ser Gly Gly Glu Asn Ser 
435 440 445 

TCC GGA CAA GGA CAA TTC CCA TA GCGATTATCT CCCAATAAAA AGAAATGTCA 1853 
Ser Gly Gin Gly Gin Phe Pro 
450 455 

CTGGCAAGAT CCTACCTTTT CCCCAACCTT TTTTTTTAAA ATAAAGGGTT TTGTACCACC 1913 
TTTAGAAGAA AAAAGAATCC TTGGCCCGGG CCAATTACCC ATTGCCGAAG GAGCTACCCA 1973 
ATAAAAAAGA AATTTTCCCT TTTTAGGGGG GGAGTCCTTT TTTTCTATCT TGGGGTGGGG 2033 
ATTGTTGGCC CCCACCAGGG GACCTTTATT AATTTATAGC ATGAATCTGG CGATTTTGCC 2093 
TGGCCTACTT ATTATCCAAG CCGCCCAGCC AAAAATATTA AACCAAGGAA TCCTAAGAGT 2153 
AGGCGGTACA GTTTACCAAC ACCTGCTGGA GGG GATGGAT ATCGCCCTCA ATACCACTTT 2213 
TCTGTTCCCG ACAAATGGAA AAATGATCCG CAGAGGCCCA TCTTTTTTGG GGGAAGTATC 22 73 
ATTACTATTA CCTTTACAAC AAAGACTATC CAGATGGAAA TGGTACGGAA TGGCGGCATG 2333 
CAACGTCCGA AGATTTATTG CATTGGACGG ACGAAGGGAT TGCCATCCCG AAGTATAC C A 23 93 
ATAAAAATGG TGATC 2408 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
GAYTAYAARG ARGAYTAYGG NTTYGC 
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